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Squeezed primordial non-Gaussianity can strongly constrain early-universe physics, but it can 
only be observed on the CMB after it has been gravitationally lensed. We give a new simple non- 
perturbative prescription for accurately calculating the effect of lensing on any squeezed primordial 
bispectrum shape, and test it with simulations. We give the generalization to polarization bispectra, 
and discuss the effect of lensing on the trispectrum. We explain why neglecting the lensing smooth- 
ing effect does not significantly bias estimators of local primordial non-Gaussianity, even though 
the change in shape can be > 10%. We also show how tnl trispectrum estimators can be well 
approximated by much simpler CMB temperature modulation estimators, and hence that there is 
potentially a ~ 10-30% bias due to very large-scale lensing modes, depending on the range of mod- 
ulation scales included. Including dipole sky modulations can halve the tnl error bar if kinematic 
effects can be subtracted using known properties of the CMB temperature dipole. Lensing effects on 
the <;nl trispectrum are small compared to the error bar. In appendices we give the general result 
for lensing of any primordial bispectrum, and show how any full-sky squeezed bispectrum can be 
decomposed into orthogonal modes of distinct angular dependence. 



I. INTRODUCTION 

A squeezed bispectrum or trispectrum produced by local primordial non-Gaussianity is observable in the CMB, 
and if observed would give a powerful way to rule out simple single-field inflation models and strongly constrain 
^ \ general properties of inflation. The local non-Gaussianity produced in the CMB can be thought of as a modulation 
of small-scale perturbations by large-scale modes, so over a large overdensity there will be more (or less) small-scale 
power than over an underdensity, depending on the sign of the non-Gaussianity. However if we observe the small- 
scale modes they will be gravitationally lensed, so in the squeezed limit we expect to see a modulation of the lensed 
small-scale power spectrum due to large-scale modes. This may be important because lensing smooths out acoustic 
structures, changing the detailed shape of the bispectrum and trispectrum. Previous work Ref. |lj has shown that 
* for the temperature bispectrum due only to local primordial non-Gaussianity, the bias due to this change in shape is 
very small. However since the shape is changed, accounting for lensing might be important to correctly identify the 
form of the non-Gaussianity; for example a different shape which is orthogonal to the unlensed bispectrum will not 
generally be orthogonal to the lensed bispectrum. In this paper we give new simple approximations for the effect of 
lensing on the squeezed CMB temperature and polarization bispectra which allow the effects to be calculated easily. 
We test these approximations against simulations, and quantify the importance of the lensing at different levels of 
primordial non-Gaussianity. 

Previous work has investigated the lensing bispectrum in detail, its potential bias on local non-Gaussianity estima- 
tors, and its impact on the variance of the primordial non-Gaussianity estimators The bispectrum produced 
by lensing turns out to be significant, corresponding to a projection of /nl ~ 9 onto the local shape, and should be 
detectable by Planck. However the effect is easily modelled and subtracted since the detailed shape of the lensing 
bispectrum is actually very different from the local shape . Here we address the different issue of how lensing affects 
any other primordial bispectrum that we might want to observe, and assess whether the change due to lensing is 
important. 

We then extend to trispectrum estimators, and estimate the bias on local tnl and c/nl trispectrum shape due to 
lensing. We show how this can be calculated easily using simple approximations for the CMB trispectra, and then 
also discuss the effect of lensing on any primordial local trispectrum. The lensing trispectrum is large, and can be 
detected at high significance with Planck, but we shall see that its distinctive shape and scale dependence is very 
different from local primordial non-Gaussianity. We show that although the projection of the lensing shape onto the 
local primordial shapes gives a signal that is much larger than would be expected in most inflationary models, it is 
still only a small-fraction of the expected observational error bar. 
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Aside from the detailed analysis of the lensing effects, we also give a few results of more general interest. In 
particular we give simple analytic forms for the CMB bispectra and trispectra that are quite accurate in the highly 
squeezed limit, and show how general squeezed CMB bispectra can be decomposed into modes of distinct angular 
dependence. We also demonstrate that tnl can be modelled very accurately as an angular modulation of the small- 
scale CMB temperature power, and hence can be estimated quickly and nearly optimally using statistical anisotropy 
estimators. This also clearly shows the importance of the dipole component of the modulation, and how it must be 
carefully distinguished from the kinematic dipolar modulation. 



II. LENSED SQUEEZED BISPECTRA 



Lensing deflection angles are only a few arcminutes, though coherent on degree scales. As such, lensing only has 
a large effect on relatively small scales. Local bispectra depend on three wave numbers L, 1 2 , I3 (we restrict to 
h < h < ^3 for convenience), and most of the signal is in squeezed triangles with l\ <C h,h- It is therefore a good 
approximation in many cases to take the largest-scale mode to be unlensed: T(li) ss T(\{). This approximation greatly 
simplifies many calculations with very little loss of accuracy, and also makes a non-perturbative analysis tractable, 
as shown for the CMB lensing bispectra in Rcf. [Q. For the moment we only consider temperature bispectra in the 
flat-sky approximation, and hence wish to calculate 

{T(h)f (1 2 )T (1 3 )) » {f(h)f (1 2 )T (la)) = ^-b hl2l3 S(h + h + 1 3 ), (1) 

where bi^i 3 is the reduced lensed bispectrum. The approximation was previously called the linear (unlensed) short- 
leg approximation. In Appendix [A] we give the general result for any shape, and show that the result for the lensed 
temperature bispectrum obtained from the linear short-leg approximation is correct to quadratic order in a squeezed 
expansion. 

We can now proceed to calculate the lensed bispectra, following the methods and notation used for calculating the 
lensed CMB power spectra via lensed correlation functions in Ref. fl , with 



T(l) = J — T(x + a)e- ll - x (2) 

where a tilde denotes the lensed field and a is the lensing deflection angle. Hence in the unlensed short-leg approxi- 
mation 

(T(h)f (l a )f (la)) = / ^^^^3( T (l 1 )T(l^)T(l! i ) e - i;1 - X2 e - i;1 -^ e ^-^ +a2 )e i;1 3-( x 3+«3) ) . (3) 
J Ztt Ztt Ztt Ztt 

The correlation between T(li) and the lensing potentials gives rise to the lensing bispectrum. We are not interested 
in this term here, and so only keep remaining terms where a can be taken to be uncorrelated to T . Hence 

(T{l x )f{h)f{h)) = j£yij ^^^ W ^-^-^{^e^) (4) 

where 1 3 — — li — 1' 2 and oti = a(xi). From statistical homogeneity (isotropy on the sky) the expectation value is only 
a function of r = x 2 — X3 , so integrating out x 2 + X3 we obtain 

(T(h)f (1 2 )T (la)) = -L^U + 1 2 + 1 3 ) J ^^b llV2l> /^-^{^^). (5) 

This is very similar in form to what is required for lensing of the temperature power spectrum |]-[tJ]. Let's define 
1' = (1' 2 — I 3 )/2 = 1 2 + li/2 and 1 = (1 2 — l 3 )/2 = 1 2 + li/2 to encode the wavevectors of the small-scale modes, so that 

1' 2 • a 2 + 1 3 ■ a 3 = 1' • (a 2 - a 3 ) - ^- • (a 2 + a 3 ). (6) 



Then neglecting non-Gaussianity of the lensing potentials, 



d 2 rd 2 l' ..„„, n ( 1 1 



= / bi.ini e ( ' exp 

J 2tt 2tt 123 P 

fd 2 rd 2 Y ir . {1 ,_ 1} / 1 



1' • (a. 2 - a 3 ) - • (oL 2 + a 3 ) 
I' 2 (a 2 (r)+cos2^, r C gh2 (r)) + | (C gl (0) + C gl (r) - cos 2^ ir C g i, 2 (r)) 



(7) 
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where cr 2 (r 32 ) = {(0:3 — ct2) 2 }/2, and C g i^{r),C g \(r) are denned as in Rcf. || (sec. 4.2). 

For squeezed shapes the second term in the exponential O(/ 2 C g i(0)) is very small (same order as things we've 
already neglected by using the unlensed short-leg approximation) and hence 

Khh « |^^6 w ^ e tr (1 '- 1) exp^-y[ CT 2 (r)+cos2^C gl , 2 (r)]^ . (8) 

If bji/'i' were a function only of l\ and |1'|, this could be evaluated trivially using exactly the same form as the result 
for lensing of the power spectrum. 

More generally we can parameterize the bispectrum in terms of h, I = II3 — I2I/2, instead of h, I2, h, where 4>iii 
is the angle between 1 and li. We can then expand the angular dependence of the bispectrum as 

bi M3 =Y, l hie mi * lh (9) 
in 

(see Ref. J§| for further discussion). From rotational invariance to should be even, and for parity- invariant fields the 
dependence on is only via l^zl, so we can equivalently write 

Khh = cos(m^ Hl ) (10) 

m 

where m is even and m > 0. 

Using the expansion of Eq. (^|) in Eq. (^), the angular integrals can then be done giving the lensed bispectrum 
moments in terms of integrals of modified and unmodified Bessel functions: 

Ki « / rdrJ m (lr) f dWh^v^^^' 2 E ^[Z' 2 C g L 2 (r)/2] J 2n+m [l'r). (11) 

This shows that lensing, which is on average a statistically isotropic process, does not mix the angular dependence 
of the squeezed bispectra: the lensed bispectrum 6™ ; depends only the unlensed bispectrum with the same to. For 
isotropic primordial bispectra in the squeezed limit the angular average is expected to dominate over other modes 
(unless the large-scale modes generate local anisotropy), so to that approximation one would be applying power 
spectrum lensing to angle-averaged bispectrum slices 6°; for each l\. Lensing of the 6 2 ; moments is mathematically 
identical to power spectrum lensing of the Cf E power spectrum. 

On the flat sky the Fisher correlation between two different bispectra is usually defined (for small signals) by || 



W) = 7T~2 / hdh / dl 2 'T^ i (12) 



^ J hdh J dh 6C h C h C l3 

which is zero if the bispectra are orthogonal. If the bispectra are both squeezed we can expand in terms of angular 
dependence, and obtain 

1 f r b m b' m * 

F(b, b') = -J hdh J Idl J2 ^2 (1 + 0{l\/l 2 )) . (13) 

As might be expected bispectrum components with to ^ to' are orthogonal in the squeezed limit. Since lensing 
does not change the angular dependence, this will remain true after lensing. If the correlation is defined without the 
power spectra in the denominator (or equivalently for constant white-noise power spectra, or using the bispectrum of 
whitened fields) this remains true for all triangles. 



A. Local non-Gaussianity 

The case of most immediate interest is when the primordial non-Gaussianity is of local (scalar) form, parameterized 
by /nl- In this form of non-Gaussianity small-scale modes are modulated by the large-scale scalar modes. In the 
squeezed limit, since the modulation is scalar, the modulation is expected to be isotropic, i.e. to = 0. Specifically the 
bispectrum is given by 

b(k u k 2 ,k 3 ) = 2^f NL [P(h)P(k 2 ) + P(h)P(k 3 ) + P(k 2 )P(k 3 )} 
5 

- 4/ NL P(fc!)P(fc) 

5 



9 + 15cos(20) 
16 



(14) 



4 




FIG. 1: The temperature CMB bispectrum from local non-Gaussianity (/nl = 1) with li — 30 (left) and h = 275 (right), 
projected into the isotropic component (solid lines), m — 2 quadrupole (green) and m = 4 (cyan) octopole parts. Thick lines 
are lensed using the approximation described in the text, thin lines show the unlensed bispectrum. The anisotropic components 
are small on super-horizon scales, and on smaller scales are relatively smooth so lensing has little effect. The definition of the 
angular components on the full sky is given in Appendix H. 



where in the second line we expanded in ki/k for a scale-invariant spectrum, </> is the angle between the long and 
the short-scale modes, and k = (k 3 — k 2 )/2. Thus in the squeezed limit there is no m dependence as expected, with 
leading corrections of 0((ki/k) 2 ) coming from the effect of gradients in the modulation. 

Evolution until last scattering will modify this for observations of the CMB. However for li corresponding to scales 
that are super- horizon at recombination (Zj <C 100), the corresponding perturbation can be taken to be constant 
across the last-scattering surfac e. The small-scale temperature is then modulated by 1 + (6/nl/5)C* (see more 



detailed discussion below in Sec. [ V A 1 ) , and hence the local model bispectrum is 



(T(1 1 )T(1 2 )T(1 3 )) « C^^-^(T(1 2 )T(1 3 ))^ (15) 
= ±-5(1! + h + l 3 )^NL< C *(C i2 + CI,), (16) 

where is the primordial curvature perturbation at last scattering, C*(n) = £(n, r*) and r* is the distance to last- 
scattering 1 . We can see immediately that on the lensed sky, where the lenses are taken to be uncorrelated to the 
C*, the same result is obtained simply replacing Ci with the lensed C;, and hence it should be no surprise that local 
bispectrum lensing is well modelled by power spectrum lensing. (Note that this low l\ limit is particularly interesting 
because non- linear evolution effects are under control analytically and known to be small ]l0|.) 

When l\ is larger the situation is more complicated since the modulation can no longer be approximated as being 
constant through last scattering. However the local bispectrum shape is still squeezed, so we can expect that our full 
result of Eq. (|ll|) will accurately describe the effect of lensing where only the lowest angular bispectrum moments are 
required. Numerically, using the fully-sky analogue of the angular decomposition described in the appendix, we find 
that the full sky temperature local bispectrum (calculated with CAMb) projected into only its isotropic part is ~ 96% 
correlated to the full result (Z max = 2000, no noise), and 99% correlated if both the m = and m = 2 (isotropic and 
quadrupolar) moments are retained. The quadrupolar moments are more important for larger (sub-horizon) l\ > 200. 
This is because the triangles with a given I have small-scale modes with transfer functions involving I2 and I3 that 



1 Note that Cf C * = ft(r») with ft(r) defined in Eq. (F3) 
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vary by ±h/2 depending on 0, the angle between li and 1; for l\ comparable to or larger than the separation of the 
acoustic peaks there is a significant variation in these transfer functions, giving a significant ra/0 component to the 
squeezed CMB bispcctrum for larger l\. See Fig. EL 

The unlensed short-leg approximation is expected to be quite accurate since almost all the signal to noise in 
local non-Gaussianity is at l\ < 500 for Planck sensitivity, where lensing effects are still small. We check these 
approximations numerically below by comparison with full-sky simulations. 



B. Polarization 



To consider polarization bispectra in the flat-sky approximation we need the lensed E and B modes given by the 
polarization analogue of Eq. (||). Taking the unlensed B modes to be zero, it follows from the definitions given in || 
that: 

E{\) = J ^^(lyi'-eM 1 '" 1 ) cos2<M (17) 



B(l) = J ^^(l'K 1 ''"^ 1 '- 1 ) sin 2^, , (18) 

and 4>i'i is the angle between 1' and 1. We can then use the lensed E and B modes to calculate the polarization 
bispectra combinations in the unlensed short-leg approximation as before: 

(X(h)Y(h)Z(h)) « (X(h)Y(h)Z(h)) = ^bfjjj(h +h + Is), (19) 

where X, Y and Z can be any of T, E and B. For parity invariance fields b TEB = b EEB = 0. 



1. XTE bispectra 



Following the same steps as for the temperature case, the first combination gives 

(X(h)f(h)E(h)) = J ^^^^{X^T^E^e-^-^ (20) 

Keeping terms where a is uncorrclated to T and E, and writing the expectation value as a function of r = x 2 — X3 
we have 

(X(h)f(l 2 )E(h)) = ^y5(ll +h + 1 3 ) J ^^ b UJ COs(^ Is )e i -0i-b) (e ^-a ae ^.a, ) (21) 

where I3' = — lj — I2. In the squeezed limit, expanding around li gives cos(2^y 3 ) w cos(2^/j) + 0{l\/l). Hence 
neglecting non-Gaussianity of lensing potentials and very small terms for squeezed shapes we have 

bfJI - / ^^ & wlf cos(2^ i )e 4r (I '- I) cxp (-Hi [a 2 (r) + cos 2^ r C gl , 2 (r)]^j . (22) 

Using the expansion of Eq. (^|), the angular integrals can be done as in the temperature case, giving a result in terms 
of modified and unmodified Bcsscl functions: 

l\Y E)m *>\frdrf d/'/'6|™™e-' 2 - 2 «/ 2 5:/„[/' 2 C g , 2W /2] 

J J n (23) 

(l'r)J m . 2 (lr)} . 

For the isotropic component b\ t the two terms give equal contributions, and the result is then mathematically 
the same as the result of the lensed Cf E power spectrum in terms of the unlensed spectrum (see e.g. Ref. Note 
however that in general m docs not have to be even in the case of the XTE bispectrum, since there is no symmetry 
between the two small-scale modes. Nonetheless we can expect the main lensing effect to be described in terms of the 
monopolc component, at least for low l\. 
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2. XEE bispectra 



In the case where the two small-scale modes are of the same type, there is a symmetry under interchange of I2 and 
I3, and the approximations become valid to 0{l\/l 2 ): following the previous argument 

(X(h)E(l 2 )E(l 3 )) = J ^^^^(X(I^(l 2 )£(l' 3 )e^ ■^ 2 <i' 2 -h) e ii' 3 - a3e i^' 3 -h) cos(2^ i3 ) cos(2^ i2 )) 
1 «5(1 1+ 1 2 + 1 3 ) f ^^^|| C os(2^ 3 )cos(2^ i2 )e-^- l2 )(e^^e^^). (24 ) 



(2tt) v 1 * ' °' J 2tt 2tt 

But now in the squeezed limit, expanding in li gives cos(20;' ; 3 ) cos(2^' ; 2 ) = cos 2 (20;'i) + 0{l\/l 2 ), and hence to 

odl/i 2 ) 

bfff 3 « / ^^^f| cos 2 (2^ i ) e - (1 '- 1) cxp ^[^( r ) + cos2^C7 gl , 2 (r)]j . (25) 

Using the expansion of Eq. (^), Eq. ( p5| ) can then be calculated in terms of integrals of modified and unmodified 
Bessel functions: 

[rdr [ dm™ m e~ l ' 2 ° 2 ^ 2 y I n [l> 2 C gl2 (r)/2] 

AJ J n (26) 

x [2J 2n + m (l'r)J m (lr) + J 2n+m+ i{l'r)J m+i {lr) + J2n+m-4,(l'r)J m -4,(lr)] . 

Again the isotropic component ° is given as expected in terms of the unlensed b[^ E ° in exactly the same 

way as the Cf E power spectrum is lensed. 



3. XBB bispectra 



The b bispectrum gives a similar result to b , but we assume it is generated purely by lensing of a primordial 
b XEE bispectrum: 

KPI « / ^^ff sin 2 (2Me 4r (I '-^^ • (27) 

However since the unlensed XBB signal is expected to be zero for squeezed shapes it is unlikely to be important to 
model this in the immediate future. 



C. Full sky 



So far we have only used the flat-sky approximation. This is very helpful for clarifying the relevant physics and 
keeping results simple, but in reality the observations that we have are of the full sky. Since squeezed triangles involve 
large-scale modes, it is therefore important to use a full spherical analysis. In Appendix [b| we discuss the equivalent 
of Eq. (||) , and show that as expected bispectra with different angular dependence remain orthogonal in the squeezed 
limit. A full analysis should then consider how these are lensed. 

Focussing on isotropic squeezed bispectra of most interest for primordial non-Gaussianity, we can simply use the 
understanding from the flat sky that the effect of lensing is just to lens the small-scale power, i.e. we apply the usual 
curved-sky CMB power spectrum lensing method || fjj to the reduced full-sky bispectrum 6™ ( , where I = (h + h)/2, 
separately for each value of l\. The computational cost of this method is then just N m x ^i imax times more expensive 
than lensing the power spectrum, and since the latter operation only costs a fraction of a second on a single CPU this 
is not problematic if only one or two angular moments (N m = 1,2) are required. By contrast the leading perturbative 
calculation [jj] is computationally very challenging, though it has the advantage of also being directly applicable to 
non-squeezed shapes. In detail we split the unlensed local bispectrum up into 6^ and bf^, and a small residual, lens 
the m = and m = 2 bispectra for li < 1000 using a modification of the standard method for lensing the full-sky 
temperature power spectrum || m], and then add back on the small (unlensed) residual part to obtain our estimate 
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of the full lcnscd bispcctrum. Our approximations are not valid for l\ 3> 500, and the high l\ angular moments 
become expensive to calculate, however most of the signal is at lower l\, so we only apply the lensing for l\ < 1000 
and approximate the (very small) contributions from higher l\ as being unlcnscd. It turns out that the m = 2 part of 
the bispectrum is rather smooth and not effected much by lensing, so lensing only the m = component is usually 
sufficient (Fig. |l|). 

Using our simple prescription for lensing the local non-Gaussianity we can then easily calculate various useful results 
to quantify the importance of lensing. Consider first noise-free temperature data to l max = 2000. We find that the 
lensed local bispcctrum is correlated at the > 0.999 level with the unlensed bispectrum, with the correction 8b to 
the bispectrum due to lensing only biasing estimators based on the unlensed shape by ~ 0.007/nl (Ref. 0] have 
previously shown the bias is very small). This confirms that the correction due to lensing is almost orthogonal to the 
original shape, which should not be surprising since lensing preserves total power (see discussion below). The change 
in shape due to lensing 8b is detectable at one sigma for /nl ~ 93. Including polarization data 8b would be detectable 
from lensing of the isotropic component alone for /nl ~ 22, but the bias remains small, ~ 0.01/nl- For Planck noise 
the current /nl limit is enough to rule out any chance of detecting the effect of lensing and we confirm the bias is 
negligible. 



III. COMPARISON WITH SIMULATIONS 

To test our new approximation we compared it with simulations. We generated 480 full-sky Healpix jy] maps at 
( mal[ = 2500,n S idc = 2048 with /nl = 100 following the method of Ref. Q. The unlensed input power spectra were 
taken from CAMB jl2| . The lensing potential power spectrum was then used to simulate uncorrelated lensing deflection 
angle maps, and the unlensed maps were then lensed using LensPix ^J. An estimator for the bispectrum in each 
full-sky noise-free realization is 

jijk _ ^2 ( h h h \ 

mi,m2,7Ti3 



B hl2h~ ^2 L, m9 TO q ) a hm 1 a i 2 m,2 a hm 3 ' ( 28 ) 



mi m,2 1TI3 

where i labels T, E or B, and we can relate to the reduced bispectrum &z 1 z 2 ; 3 defined using 



/ (2Z 1 + l)(2i 2 + l)(2Z 3 + l) (h h h\ , r9Q , 
Bi M3 = \J ^ Q Jb hhh . (29) 

To reduce the variance and therefore the number of simulations needed to average over, we follow Ref. jjj by subtracting 
a term that averages to give zero bispectrum, but removes much of the realization-dependent variance. We also took 
advantage of symmetries to reduce the number of sums, using: 

B'hlah = ^2 ^2 ( 2 ~ S ° m2 ) (mi m 2 m 3 )^ ^^i^^^^ms ~ ai imi a hm . 2 ai 3m3 } (30) 



1712—0 1713 = — I, 

where mi = —7712 — 013, and ai m and ai m are the lensed spherical harmonic coefficients from maps generated using 
the same random seeds but with /nl 7^ and /nl = respectively. Since we were running multiple simulations, 
and computational cost is nearly dominated by calculation of the 3j symbol, for each set of {I, m} we calculated the 
slice contributions from as many simulations as we could hold in memory. To avoid confusion with a lensing-induccd 
bispectrum, the lensing potential is generated with Cf^ = Cj = 0, so that the subtracted term involving a gives 
zero bispectrum on average. To test our bispcctrum estimation method wc compared unlensed simulated bispectra 
slices to the theory unlensed reduced bispectra generated by CAMB, with good agreement. 

As a first check on polarization bispectra, we used the publicly-available simulated non-Gaussian polarization maps 
from Ref. |Lq], which have a maximum multipole of 1024. 



1. Simulation results 



For the pure temperature bispectrum bJJ^ 3 simulations were run up to a maximum multipole of 2000. (Although the 
full-sky Healpix fil l maps were Z max = 2500, the lensing process requires a few hundred more multipoles than the scale 
you want to resolve.) The first acoustic peak of the CMB temperature power spectrum at / ~ 200 corresponds to the 
scale which has just had time to maximally compress or expand by the time of recombination. Therefore a bispcctrum 
with l\ — 10 corresponds to very large super-horizon modulations in the small-scale power; this modulation is roughly 
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FIG. 2: The fractional change due to lensing in the reduced bispectrum slices &io T i T ;+io an d ^ioTm 



result averaged over 480 realizations. The bjojj slice is somewhat noisier than the b^f^ 
black line is the theoretical approximation of this paper, which agrees well with the simulations to within sample variance. 



The red shows the simulated 
10 slice (see discussion in text). The 



constant through the surface of last scattering (the approximation of Eq. [16]). However a bispectrum with l\ = 200 will 
give a modulation in small-scale power that varies significantly through the thickness of the last-scattering surface. 
We have tested our lensing approximation for both the case of constant and non-constant modulation by simulating 
bispectra with both l\ = 10 and l\ = 200. 

Because of the closure condition, a bispectrum with equal small scale modes (h = h) looks like an isosceles triangle, 
where the large scale mode is roughly orthogonal to the short scale modes. For Is = li + I2 (eg. &io,n+io)) the closure 
condition demands that the triangle you draw has zero area and all the modes are aligned (parallel). We test if 
our lensing approximation holds for both orientations of modes by calculating bispectrum slices for both cases where 
possible. Since there is a significant quadrupolar m = 2 part of the bispectrum for l\ = 200, the slices for the different 
mode orientations are quite different, and our simulations test that the decomposition of the bispectrum into modes, 
and lensing just the monopole part, works consistently. For l\ = 10 the bispectrum is nearly isotropic, so the slices 
are very similar. 

Fig. y shows the fractional change due to lensing for the reduced bispectrum slices b^f l+w and b^Yi, averaged over 
480 simulated maps. In these plots the simulated fractional change has been normalized to remove variance from the 
large scale l\ modes: since l\ = 10 is a super-horizon mode, its modulation is roughly constant across last scattering 
and the bispectrum is roughly proportional to (Eq. fl6| ). Both slices agree very well with the simulations, but 

the fr^Tz slice has more sampling noise as discussed further below. 

Fig. |j shows the effect of lensing on the b^ 1 ] 1+200 suce 7 which also agrees with the theoretical calculation despite 
the more complicated sub-horizon form of the bispectrum. For larger l\ the bispectrum is no longer isotropic, so 
^2(xn z+200 an d &20cn ; differ significantly because of the large m = 2 quadrupole component. Contributions to the 
signal to noise arc roughly equal for every d<f> of angle between the long and the short modes; however since for 
squeezed shapes Z3 ~ l 2 + 1\ cos(<p) : there are more integer I3 per unit angle for orthogonal triangles (0 ~ 7r/2, l 2 ~ 1 3) 
than parallel (</> ~ 0, l 2 ~ h Hi)- The signal to noise per slice is therefore much less for an individual slice with I2 ~ 
than I2 ~ ±Zi, though the overall contribution to the signal is equally important for the different orientations 2 . The 
simulation sample variance on b^^i ; was too large to see the lensing effect on a single slice (the fractional effect of 
lensing is also smaller for triangle shapes to which the smooth m = 2 bispectrum component contributes significantly; 
c.f. Fig. 0). 

Polarization bispectra were estimated up to a maximum multipole of ~ 900. Fig. |4J shows the reduced bispectra 
for the polarization slices b^Q { E l+10 and b^^ l+1Q . which were also normalized to remove large scale variance. The 
agreement of the simulations with the theory is seen by eye, as well as the characteristic smoothing of peaks due to 
lensing. 



See also footnote in Appendix |b] 
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FIG. 3: Bispectrum slice &200 n+aoo- The left hand panel shows a comparison of the simulated vs theory lensed and unlensed 
bispectra, averaged over 480 realizations and smoothed over 10 /. The right panel gives the difference between the lensed and 
unlensed bispectrum for both simulations and theory, showing good agreement within the sample variance. 
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FIG. 4: The reduced bispectrum slices b 10 l l+10 and b 10ll+w averaged over 99 realizations and smoothed over 10 /. The 
black and cyan lines show the simulated and normalized unlensed and lensed bispectra. The dashed red line shows the normal 
theoretical unlensed bispectrum, and the dark blue dot-dashed line shows the lensed approximation of the bispectrum tested 
in this paper. 



2. Comparison with perturbative result 

In Fig. [5] we compare our new approximation to the result from the leading perturbative approximation of Rcfs. ^| 
for a squeezed bispectrum slice. The perturbative calculation captures the main effect, but the new approximation is 
more accurate on small scales; this is essentially exactly the same as the effect of higher-order correction on the power 
spectrum 0, since it is the power spectrum that enters the squeezed form of the local bispectrum (Eq. (|T^)). Our non- 
perturbative bispectrum approximation is easier to calculate since it only requires a standard CMB power spectrum 
lensing calculation for each l\ (only the isotropic part of the bispectrum needs to be lensed to good accuracy). We 
have only compared with one slice of the full leading perturbative result due to the high numerical cost of calculating 
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FIG. 5: The fractional change in the reduced bispectrum slice 6io,i,!+io due to lensing. The blue line shows the non-perturbative 
approximation of this paper, the black line shows the leading-order perturbative result from Ref. [jjj]. The red lines show the 
result of 1000 Monte Carlo simulations of Ref. jj]] smoothed over Al = 5. The new approximation only needs to lens the 
isotropic component of the bispectrum, and then is both significantly more accurate on small scales and faster to compute. 

the lensed spectra that way. 

IV. TRISPECTRUM 

In this section we consider the effect of lensing on local-model trispectrum estimation. The trispectrum depends on 
the wavevectors li, 1 2 , 13, 14 which define a quadrilateral. The local trispectrum is described by a sum of two shapes - 
denoted the g^L trispectrum and tml trispectrum respectively, i.e. 

(T(l 1 )T(l 2 )T(l 3 )T(U))c = (T(lx)T(l 2 )T(l 3 )T(l 4 ))f L + (T(h)T(h)T(h)T(h)}™ . (31) 

In the case of the former, the signal peaks for configurations with two wavenumbers much smaller than the others, 
e.g. l\, I2 <C Z3, 14, while in the latter case the signal peaks when one of the diagonals is small, e.g. |li + I2 1 , "=^C h- The 
Snl shape is independent of the diagonals of the quadrilateral. For simplicity we shall restrict our attention to the 
temperature trispectra, though in principle better constraints can be obtained making use of polarization information. 

The tnl shape can be generated with positive sign by scalar quadratic corrections to a Gaussian field; the g^L shape 
can be generated with either sign by cubic corrections. If generated by non-linear scalar corrections the squeezed 
shape dependence is isotropic: tnl gives a large-scale modulation of all small-scale mode power, independent of the 
small-scale mode orientation; (?nl looks like a large-scale modulation of a smaller-scale bispectrum j| . Lensing also 
generates a large trispectrum @, which is very blue compared to the local model because the modulation is due to 
a convergence (and shear) field, rather than a nearly scale-invariant primordial field. The lensing signal is also partly 
anisotropic due to the shear component; it is primarily a source of confusion for tnl since lensing does not generate a 
significant bispectrum except on the largest scales (because the small-scale lenses are uncorrelated to the small-scale 
anisotropics on the last-scattering surface). The average effect of lensing by many modes does however lead to a 
smoothing of any primordial trispectrum, similar to the effect wc showed in the bispectrum case. 

A general trispectrum may be expressed in terms of the five dimensional quantity p^*(L), where 

(T(1 1 )T(1 2 )T(1 3 )T(1 4 )) C = + 1 2 |) +1 3 |) +1 4 |)) ^(li + 1 2 + 1 3 + U). (32) 

The tnl signal peaks in highly squeezed shapes where L h,h,h, h- If L = 1 1 1 + h\ is small, |li + 1 3 | and |li + 1 4 | 
are typically much larger, and hence the trispectrum is well approximated by just the one pJ^(L) term, where L is 
the wavenumber of the modulation mode. 
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The <7atl shape is independent of the diagonals L of the quadrilaterals and may be expressed in terms of a four 
dimensional quantity t; 1 i 2 i 3 i 4 a s 



(T(1 1 )T(1 2 )T(1 3 )T(1 4 )} 



Ski 



1 



rthhhh^ih + h + h + U). 



(33) 



This equation may be written in the same form as Eq. ( |32[ ) by identifying U^i^ = + Pilit + Pilu with 



j 2 i 



A. tjvl Trispectrum 



The tnl shape is most easily understood as measuring the power in large-scale modulations of small-scale power; 
we will show how tnl can easily be estimated very accurately (and nearly optimally) by making use of modulation 
estimators. These estimators reconstruct the modulation field on the CMB for each L = l x + 1 2 , and the trispectrum 
is then measured by the modulation power spectrum for each value of L. Using this approach the bias due to lensing 
and the effect of lensing on the estimators is very easily calculated, and is essentially equivalent to using an accurate 
approximation for the full trispectrum. For completeness we will then relate to a more direct full calculation. Since 
the tnl trispectrum involves nearly full-sky modulation scales, in this section we will do a spherical analysis. 



1. Modulation estimators for tnl 



A trispectrum of tnl form is generated by a primordial modulation of the fluctuations, where the modulation 
is not necessarily correlated to the large-scale Gaussian fields. For example if the small-scale Gaussian curvature 
perturbation £o is modulated by another field <j> so that 

C(x) = Co(x)[l + 0(x)] (34) 

where <^(x) is a small large-scale modulating field. The large-scale modes of 4> can be measured by measuring the 
modulation it induces in the small-scale £ power spectrum. If <f> has a nearly scale-invariant spectrum, the nearly- 
white cosmic variance noise on the reconstruction dominates on small-scales, so only the very largest modes can be 
reconstructed (e.g. Ref. |L8| find almost all the signal in the CMB is at modulation multipoles I < 10; see Fig. ^ 
below). So a constraint on <fi is going to be limited to only very large-scale variations, in which case the scale of the 
variation is very large compared to the width of the last-scattering surface; i.e. in any particular direction a large- 
scale modulating field will modulate all perturbations through the last-scattering surface by approximately the same 
amount. A large-scale power modulation therefore translates directly into a large-scale modulation of the small-scale 
CMB temperature: 

T(n)«T s (n)[l + 0(n,r*)], (35) 

where T g are the usual small-scale Gaussian CMB temperature anisotropies and is the radial distance to the last- 
scattering surface. This is not quite right for the large-scale temperature due to e.g. the ISW effect, but there are 
relatively few large-scale modes and they have almost none of the signal to noise in the power modulation, so this 
error is harmless. But using this rather good approximation we can easily quantify the trispectrum as a function of 
modulation scale by using the power spectrum of the modulation, 

r NL (L)^^. (36) 

Here were normalized conventionally relative to C^* , the power spectrum of the primordial curvature perturbation 
at recombination (which we assume is known theoretically, but in practice cannot easily be measured at the relevant 
scale). For a scale-invariant primordial spectrum, = A s the angular (not CMB!) power spectrum is simply 

H£±2£-A.. (37, 

Note that tnl ~ 500 corresponds to an 0(1O -3 ) modulation. In Appendix <Q) we relate this to what you get applying 
equivalent approximations to the full form of the tnl trispectrum. 
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In the simplest local non-Gaussianity model with £(x) = Co( x ) + f /nl(Co( x ) 2 — (Co))i we can S P^ Co into long and 
short modes Q and £ s , then as far as observations of large-scale modulations are concerned we have 



C = C(l + |/NL[20 + G])+0(l + f/NLC/)-|/N L (Co) 



+ Cs l + 



6/] 



NL 



(38) 



The modulation model for the small-scale modes is then <f> = ^f^Q, and hence tnl(^) = (6/nl/5) 2 is scale-invariant. 
Any additional modulation field that is uncorrelated to £o will not change the bispectrum but will increase tnl(-£0> 
hence in general tnl(£) > (6/nl/5) 2 . 

Regarding the modulation field as fixed, the small-scale fields being modulated are Gaussian so we can write down 
a Gaussian likelihood function. We can then try to find the modulation field that maximizes this Gaussian likelihood, 
givingan estimator for the modulation field. The technical details of how to do this are described in more detail in 
Rcf . [|19| , with the result that there is a quadratic maximum likelihood estimator for the large-scale modulation field 
</> given by: 



(39) 



where h is a quadratic function of the filtered data that can be calculated quickly in real space: 



h? - - 
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(40) 



Here 



cov 



l T is the observed CMB sky after inverse- variance filtering (which accounts for sky cuts and inhomo- 



geneous noise), and C; 2 is the lensed C; (since the covariance in the lensed sky involves the lensed power spectra, and 
we neglect the effect of lensing modes correlated to 4>lm)- The normalization T is given by the Fisher matrix. In the 
simple full-sky case with isotropic noise this is 



Im.V ' 



$U'$rx 
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(2Zi + l)(2Z 2 + l) 
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(41) 



and the estimator noise is Nl = T 



LL ■ 



Here C z tot is the lensed spectrum plus any (isotropic) noise. For low L and 



high Z max the reconstruction noise is very nearly constant (white, because each small patch of sky gives a nearly- 
uncorrelated but noisy estimate of the small-scale power). We can then define an estimator of the modulation power 
spectrum 



L l ~ 2L 



1 



I0LA/I 2 



1 ^ 

M 



N 



(0) 



(42) 



where = {\4>lm\ 2 )o is a noise bias for zero signal (the same as Nl in the full-sky case). Additional 'W^ 1 '" 

(0(C*)) biases from cross-terms are expected to be small, and in any case vanish in the case of zero signal and hence 
would not give a spurious detection. On the cut sky pseudo-Cz estimators can be used fl9| , pp|| . 

For each value of the modulation scale L, Eq. (j42j) gives a separate estimator for t^l- In the ideal case we can 
combine them by inverse variance weig hting with var(Cf ) = 2iV£/(2Z + 1), giving the combined estimator 



_ J 2l var[f N L(£)] 1 tnl(L) 
EiVar^NL^)]- 1 



(2L 



T NL / j 

L 



2Nr 



(43) 



where (ct? nl ) 1 = J2l(^L + 1) / (2N l[C l *] 2 ) . Using the approximations C L cx (L(L + 1)) 1 . and the constancy of the 
white reconstruction noise Nl , we have 



T"NL 



in 



2L + 1 CI 
L 2 (L + l)*~cY 



(44) 
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If there is a scale- invariant signal so oc C L , we see that as expected the contributions fall rapidly oc 1/L 3 , as 
expected when measuring a scale-invariant signal that has large white noise. The total variance is 

9 T 2 AT 2 

Note that if the L = 1 dipolar modulation is excluded so that L m in = 2, the tnl variance becomes four times larger 
than if the dipole is included. We have 95% of the signal at L < 4, justifying the squeezed approximations used, 
and hence confirming that they should be good to percent-level accuracy. Assuming the only non-Gaussian signal 
is tnl, for a noise- free temperature-only experiment to Z max = 2000 Eq. (|4^) gives ov NL ~ 150 (in good agreement 
with |Q given parameter dependence; note that errors in |2(J are a factor of two larger because they exclude the 
dipole); for Planck cr TNL ~ 300. Using numerical values for gives somewhat more accurate results for non-scale 

invariant spectra, with o> NL = (J2 L var[/?NL(£)] -1 ) ~ ~ 134 for noise-free data to Z max = 2000, and ov NL ~ 200-300 
for Planck depending on assumptions. 

The approximation here essentially re- writes the tnl estimators of Ref. [^0| by analytically approximating the radial 
integrals, resulting in the CMB temperature power anisotropy estimators of Ref. JL9]]. The approximation consists of 
taking the recombination visibility as a delta- function compared to the modulation scale of interest, combined with 
neglect of small cross terms that complicate the relationship between the modulation power spectra and trispectrum 
(see Ref. |2l], |2^] for an extensive discussion in the context of CMB lensing where the latter corrections are more 
important on small scales). In exchange for these approximations we find estimators that are simple to interpret, fast 
to evaluate, can incorporate full inverse- variance weighting for optimality with real data, and allow other complicating 
effects such as lensing to be easily understood and modelled. 

The approximate estimator described here accounts for the average effect of small-scale lensing modes in a very 
simple way, since the only effect is to change the map filter functions so that they involve lensed rather than unlensed 
power spectra. This is consistent with the expression of the lensed trispectrum given in Appendix which only 
involves power spectra in an equivalent approximation: the large-scale primordial modulation modes cause a modu- 
lation in the fully non-linear observed small-scale power. The effect of the modulation on the lensing potential can be 
neglected since it only leads to an O(10~ 3 ) correction to the already-small lensing effect. 



2. Lensing bias 



The above analysis has assumed the only anisotropy was due to primordial modulation. There will also be lensing 
anisotropy due to large-scale lensing modes (and, for L = 1, Dopplcr modulation and angular abberation, as discussed 
further below). However as discussed in detail in Ref. p3| it is straightforward to account for multiple sources of 
anisotropy if required, or suboptimally by including lensing in the simulations that are used to subtract noise biases. 
Here we simply estimate the impact of large-scale lensing modes on a tnl estimator: i.e. if we neglect lensing, at 
what level is the simplest tnl estimator biased? 

Writing the quadratic estimator in harmonic space 



ii,mi,/ 2 ,«i2 



(2Z + l)(2Zi + l)(2Z 2 + l) /j i x i 2 \n l x l 2 
4-7T \0 / \m mi 7772 



(46) 



we can simply calculate the expectation due to other forms of anisotropy such as lensing. Averaging over small-scale 
Gaussian lensing and temperature modes (Im) ^ {hmi) for fixed convergence K/ imi we have (J, 24 
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(47) 



where we defined 



Mil 



(48) 
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FIG. 6: Contribution of different modulation scales L to tnl. The signal to noise falls off rapidly with L, ~ l/L 3 (solid line). 
The lensing convergence power spectrum is much bluer than scale invariant, and bias contributions to the tnl estimator from 
lensing are nearly flat in L; the bias is small if low L max is used, and saturates at around (tnl) ~ 100 if higher L max is used. 
There are also kinematic contributions to L — 1 that must be removed if L = 1 is used. 



As before all the CMB power spectra are the lensed power spectra. For the simplest full-sky case with isotropic noise 
©Zm = Ti m /C\ ot , so the mean field due to lensing is given from Eq. ( filf ) and is simply proportional to k: 



> . N L ^ (2Ji + l)(2Ja + 1) At h l 2 \ 2 (C h + C u )K LllU _ 
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(49) 



A positive convergence effectively shifts the CMB power spectrum to the left, decreasing the total power, so oil is 
negative, and numerically is typically 0(— 1). Neglecting some small cross-terms we can then estimate the bias on Cf 
due to large-scale lensing modulations by averaging over the lensing modes: 



and hence from Eq. (E3 



(tnl) ~ L 



OO 



2L + 1 a\Cl 



mm L^ ln i2 ( L + 1 ) 2 Ci 



(50) 



(51) 



(or a similar more accurate result using Eq. (p3[)). 

The scaling of the bias contribution with L is shown in Fig. [| in comparison with the contributions to the tnl 
signal to noise. The expected bias due to lensing depends on the range of L used. Almost all the tnl signal is at very 
low L, so a low i max < 10 includes almost all of the signal to noise. Using this, the Planck lensing bias is tnl ~ 17, 
or for noise-free data up to l m&x = 2000 it is tnl ~ 40. If the dipole is not included the bias is relatively larger since 
the expected modulation is scale-invariant but k is blue: for Planck with L > 2 the bias is t^l ~ 60. This is still only 
O(10%) of the size of the error bar, and hence is relatively much less important than for the bispectrum where the 
lensing signal can give /nl ^9 [0, f| . 

However the bias becomes quantitatively more important if a larger L max is used. For example using L max = 40, 



1 gives tnl ~ 80 for noise-free data, which increases to tnl ~ 300 if L„ 



2: a onc-sigma bias. The reason 



for this is that although the modulation estimator weighting falls rapidly with L, the lensing convergence power is 
sufficiently blue that its contribution to the bias only falls slowly. This clearly motivates using a lower L max for tnl 
analysis, where almost all of the signal is found but the convergence power is small: as we showed above, if L max ~ 10 
is used the lensing bias is then small enough to be only a small fraction of the error bar. 

However as in the case of the bispectrum, the expected bias can also easily be subtracted to obtain an unbiased 
estimator if required; the simplest method is just to include lensing contributions to in Eq. ( |42"| ) so that the 
estimator has zero expectation over realizations of lensed skies. If tnl is non-zero, then there will also be cross-terms 
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from universe-sized modes generating correlated low multipole signals in both the lensing potential and the primordial 
modulation. 

The Doppler dipole due to the earth's motion is ~ 10 -3 , and induces aberration and CMB temperature modulation 
at this level |25|-p7j|. If data are used to constrain tnl using the dipole modulation (which shrinks the error bar 
by a factor of two relative to starting at L = 2), the dipole- induced signal must be subtracted since its modulation 
reconstruction has signal-to-noise larger than one at Planck resolution. However we know the direction of the kinematic 
dipole in the CMB temperature, and hence both the magnitude and direction of the modulation that is expected; it 
can therefore be directly subtracted from the reconstructed dipole modulation field, and as long as this can be done 
to 0(10%) accuracy it should not be an obstacle to also detecting a primordial tnl that is detectable in the absence 
of kinematic effects. If the dipole is not used, the dominant primordial signal is then in the quadrupole, which may 
be susceptible to observational artefacts due to the quadrupolar dependence of Planck scan strategy, for example due 
to gain variations. Even in the absence of unexpectedly large primordial signals, tests for modulations can be a useful 
probe of instrumental systematics. 



3. Lensing bias from simulations and full tnl estimators 



To check consistency of the various approximations, we implement the trispectrum estimator as described in Ref. |2q ] 
and estimate the lensing bias from simulations. The estimator was tested using 300 Gaussian maps, which were then 
lensed using LensPix as before. 

The averaged full-sky estimator with isotropic noise is given by 

( £ ) = — EE — {L)hl)MgtlhL 
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where the angular brackets (...) refer to averaging over the simulations and the Gaunt integral is given by Q 7 ^ 
J dhYi imi Yi 2 m 2 Yi 37n3 . The denominator is given by the Fisher matrix, 
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(53) 



where {. . . } refer to the Wigncr 6j symbols, and P^(L) = Py^i^hi^Lhi^L, with h hhL = 
(2ii+i)(2i2+i)(2£+i) ^li h L\ ^ j n ^ e cage Q £ local model tnl trispectrum (given by jClj)), the squeezed shape 

dependence means that only P^ 2 (L) contributes significantly, and F may be efficiently and accurately calculated 
using H, 
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The optimal signal to noise of the trispectrum is given by F, and the corresponding variance of tnl is given by its 
inverse. For i max = 2000 the error bar is given by ct(tnl) = 129, where the sum over L has been calculated for 
1 < L < 100. We checked that the sum has converged to within ~ 2% of its final value by L — 5 and to within ~ 0.5% 
by L = 10. Furthermore, the replacement of Fl(tx,T2) by Fi,(r*,r*) = C£*; a s described in Appendix ^ results in 
little loss of accuracy, with the variance in tnl agreeing with the previous estimate to within 0(1%). Calculating the 
full trispectrum of Eq. (CI) however involves doing radial integrals, and the numerical error involved can be larger 
than the theoretical approximations made in the previous sections. The direct calculation of c(tnl) = 129 agrees 
quite well with the previous modulation-estimator value of ct(tnl) = 134, with a small portion of the difference being 
made up by 0(1%) theoretical approximations, and few percent numerical errors. 

Using Fl(ti, i'2) ~ Fl(t*, r*), the estimator for the amplitude of the local tnl model reads 
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where 



{A*B) LM = f dnY£ M {A) f drr 2 A(r,n)B(r,h), (56) 

A(r,n)=J2 ^B^M j (57) 

B{r,n)=2^ . ( 58 ) 

AB{hM)= [ drr 2 (a h (r)p l2 (r) + a l2 (r)P h (r)) , (59) 



and where C* = for unlensed Gaussian maps, but is replaced by the lensed angular power s pec trum for the 



lensed Gaussian maps (plus noise if applicable). Using the further accurate approximation of Eq. (C5), Eq. fl55[ ) is 



consistent with the previous modulation estimator of Eq. (43) for the low L of interest. In fact we have verified that 



use of this approximation in the estimator recovers the same variance and bias to 0(1 — 2%). 

The variance and mean of the estimator has been calculated over 300 Gaussian simulations giving a sampling bias 
of 3, and a one sigma error of 130. This error bar agrees quite well with the expected variance of 129. 

Applying the estimator up to L max = 10 to the lensed maps reveals a bias of 49 and a slightly enlarged variance 
of 141. Hence, application of the equation ( |52] ) verifies that large-scale lensing modes may introduce a small bias on 
tnl estimators of 0(30%) the size of the error bar for noise-free data out to l max = 2000. Extending the summation 
to L max = 100 increases the bias to 107, confirming the expectation that the bias would become quantitatively larger 
due to the blue spectrum of the lensing convergence. A realistic analysis on the lensed sky can restrict to a much 
lower value of i max to keep the bias small, with very little loss of tnl signal. 

We conclude that the modulation estimator is fully consistent with a more brute-force tnl estimator, and in fact 
that the rather accurate theoretical approximations (< 1%) involved are likely to be more accurate than numerical 
errors involved in the full estimator (which involves performing numerical radial integrals). Both are accurate to 
within a small fraction of the error bar, but the modulation estimator is significantly faster to calculate and makes it 
much simpler to calculate complications such as lensing bias. 



4- Lensing of the tnl trispectrum 



We have shown how to easily account for the main effects of lensing when estimating tnl , and quantified the bias 
due to overlap between the tnl and lensing trispectrum shapes. Here we briefly consider how this is consistent with 
the direct result for lensing of a primordial tnl trispectrum. 

The lensed tnl trispectrum is defined by 

(T (lx)f (1 2 )T (l 3 )f (1 4 )> C = (pllllQh + 1 2 |) +p\±(\h + h\) +P\±(\h + U|)) S(h + h + h + U). (60) 

In terms of the unlensed trispectrum wc have 

(f(h)f(h)f(h)f(h)} c =J UU (^5?) 6 (J2^J^2 [^f|(]li + I^I) + (2 perms)" 
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Using statistical homogeneity we may assume that the expectation value is independent of X)i x i- Defining I43 = 
(U — l3)/2, I21 = (I2 — li)/2, L = li + I2, r = (xi + X2 — X3 — X4J/2 this allows us to write 
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where we assumed a squeezed shape, L', L <C h, l[ and so only kept p\^(L). Also for squeezed shapes we can neglect 
the L' term in the exponential (as for the bispectrum), and we have 

(e^-^V 1 -^ = cxp ( - i((l 21 .a 2 i) 2 }) cxp ( - i((l 43 .a 4 3) 2 )) cxp ( - ((l' 21 .a 21 )(l^.a 4 3))) • (63) 
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The first two terms are exactly the terms that appear in bispectrum lensing (c.f. Eq. ([70), and hence are well 
approximated by power spectrum lensing of the small-scale power at wavenumbers I21 and Z43. The last term is an 
additional effect coming from the fact that the lenses acting on different pairs of small-scale modes are correlated (all 
small-scale modes see the same lensing potential realization). It depends on r, and hence in principle gives rise to 
couplings between L / L'; however we are interested in very low L, corresponding to a scale much larger than the 
arcminute lensing deflections, so wc can expect L « L' to a good approximation for all trispectrum components of 
interest. This is consistent because the difference in deflection angles at Xi,X2 and X3,X4 will only be very weakly 
correlated for large r separation of the pairs of points, so the third term in Eq. ( |63| ) can be neglected. 
This allows us to find the following estimate for the lenscd t^l trispectrum, 



x exp ( - i((l' 21 .a 2 i) 2 )) exp ( - \{{\' A ^f)) , (64) 

where the expectation values can be calculated exactly as for power spectrum and bispectrum lensing. Since the 
trispectrum is well approximated as involving only the CMB power spectrum on the small-scale modes (see Ap- 
pendix ^|), this is consistent with lensing simply replacing the unlensed power spectra by the lensed spectra. 

B. gNL trispectrum 

The c/nl trispectrum can be generated by local cubic corrections to a Gaussian field. It essentially measures the 
correlation of a small-scale local /nl with the large-scale field. Since local /nl itself has all the signal in squeezed 
shapes, this means that the signal in <7nl is dominated by shapes with l\,l2 <C Z3, Z4, though unlike in the tnl 
trispectrum the signal is not all in l\ <~ 1. For simplicity we will use the fiat-sky approximation here. 

We can approximate the primordial (?nl signal by taking the large-scale curvature modes with wavenumbers li and 
I2 to be constant through last scattering. For a primordial curvature perturbation £ = Co + .9nlCo the contribution to 
the trispectrum in this regime is then determined by 

(T(1 1 )T(1 2 )T(1 3 )T(1 4 )) C « C^Cl^ 5 J hrs J hr (T(1 3 )T(1 4)) 

6 



^g NL Cl^C^ * (C l3 + C u )5(h +l 2 + 1 3 + 1 4 ), (65) 



where C* is the primordial curvature perturbation at last scattering. This is a good approximation for li,l2 <C 100, 
but breaks down for the contributions where I2 > 100 where the finite thickness of last-scattering becomes important. 
However it is sufficient for ballpark estimates of the signal to noise and bias. 

1. Lensing bias 

With h, I2 < ^3, h, the <?NL-typc trispectrum due to lensing is of the form 

(f (ijf (h)f (ia)f (i 4 )> c « c£*c£* ( mi) ! mhy (rOa)r(U))) , (66) 

where tp is the lensing potential that gives the deflection angle a = V^. Using 

j c i-l'-(x+a) _ J_i . j/ e il'-(x+a) e -il-x fQy\ 

6^(1)* 2tt 
and evaluating to leading order we have 



(f (h)f (l 2 )f (l 3 )f (l 4 )) c « j^C^'C^' 



li • 1 4 1 2 ■ hCi 4 + li • I3 I2 • I3C/3 

ii • (h + h)h ■ (ii + i 3 )qi 1+ i 3 | - ii • (ii + U)i 2 • (li + h)c lll+ul 



«5(1 1+ 1 2 +1 3 + 1 4 ). (68) 
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Note that for l±, I2 <C h, h the two individually large pairs of terms nearly cancel. This leaves the total second-order 
response of the small-scale power being small, as expected. Numerically the Fisher signal to noise on this part of the 
lensing trispectrum for l max = 2000 gives S/N <C 1, and hence though much larger than likely primordial signals, is 
guaranteed to project onto (?nl at much less than the <?nl error bar (numerically we find a bias (<7nl) ~ 0(100) where 
the error bar is ct 9nl - O(10 5 ) @). 

In addition to Eq. ( pq) the lensing trispectrum |3(J 

p\f 2 (L) a -C^(li-LC4 1 +l 2 -IAi)(la-LC , , s +U-LC , u )+ perms (69) 

that contaminated tnl will also contaminate <?nl for smaller £1, Z2- F° r the configurations that contribute to <?nl at 
low ii,?2 the signal-to- noise in the lensing is however also below unity, and hence the projection onto <7nl is also 
guaranteed to be much smaller than the error bar. 

We conclude that lensing bias on <7nl, though much larger than likely primordial signals, is small enough compared 
to the error bar to be neglected. 



2. Lensing o/^nl 

F° r h ^ 500 we expect the unlensed short-leg approximation to be very accurate, and we now make this approxi- 
mation in both of the c/nl short legs labelled by l± and I2, with index ordering convention l\, I2 < ^3, 14: 



(f (h)f (1 2 )T (l 3 )f (U)) c » (T(h)T(h)f(h)f(U)) c 

d 2 x 3 d 2 x 4 d 2 l' 3 d% 



(2tt) 6 



(5(1! + \ 2 + I3 + Ii)<i 1 , al / { ;e < ( , «- ,8) - xa e < ^- u 5- x *(e <, «- as c iI i- a *). (70) 



The average effect of small-scale lensing is therefore very much like the squeezed bispectrum, with just the two short- 
scale modes being lensed, consistent with replacing unlensed with lensed power spectra in Eq. (|6^). An approximation 
for the lensed trispectrum therefore follows from corresponding approximations: 

fd 2 rd%o ( 1,.,, ,,A 

UihhU « J 2^-2^*«il a «i«i CX P ( J 43 ■ "43) > J . ( 71 ) 

which if desired could be evaluated in exactly the same way as for the bispectrum. However since observational 
constraints on <?nl are very weak, and we know that local /nl estimates are not much affected by lensing on average, 
modelling the effect in detail is probably unnecessary. 



V. CONCLUSION 



We have discussed two effects of lensing. Firstly, the presence of large-scale lensing modes klm modulate the small- 
scale CMB and hence partially mimic the effect of primordial modulations, giving rise to trispectrum and bispectrum 
signal that is potentially a source of confusion with primordial non-Gaussianity. The lensing effect is well-known to 
project onto the local non-Gaussianity model at the significant level of /nl ~ 9, and we have now also calculated 
the projection onto tnl; the signal is large compared to likely levels of primordial signal, however since the cosmic 
variance errors are large, the bias is only ~ 10% of the size of the error bar for Planck, and hence relatively not as 
important as for the bispectrum. For gjsih the effect is probably negligible. Secondly, the effect of many smaller-scale 
lensing modes changes the detailed shape of any primordial bispectrum and trispectrum, similar to the smoothing 
effect on the power spectrum. For squeezed shapes we showed how this effect can easily be calculated, and that it is 
well approximated as applying CMB power spectrum lensing to each slice of the m = isotropic part of the squeezed 
bispectrum. 

The potential bias on primordial signals due to the average change in the primordial shape due to CMB lensing 
is however very small. This is easily understood: since CMB lensing just moves points around, the nth moment of 
the temperature at any given point ([T(x)]™} is unchanged under lensing on average, for example the total power is 
conserved: 

(T(x) 2 )= /dm^ = /dln^- (72) 

Local non-Gaussianity looks like a modulation of small-scale power as a function of position. In the squeezed limit 
we are considering very large-scale modulations, and the simplest way to measure the modulation is just to calculate 
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the small-scale fluctuation variance as a function of position. However this variance is unchanged under lensing, and 
hence the non-Gaussianity estimate is unchanged under lensing on average, so it is unbiased. The total local skewness 
([T(x)] 3 } is also invariant under lensing on average, giving 3 



<T(x) 3 } 



dlfdl 2 



dlfdl 2 - 

(2tt) 4 lhh 



(73) 



In reality the picture is a bit more complicated because estimators weight by the inverse signal plus noise, so the 
estimators are not exactly unbiased, but nonetheless there is a good reason why the bias is expected to be small for 
local shapes. The detailed change in shape due to lensing could in principle be detected if /nl ^ 20, but the change is 
almost orthogonal to the unlenscd shape and hence neglecting it is usually harmless. On the other hand the bias due 
to correlation between primordial and lensing-induced non-Gaussianity should be subtracted or consistently modelled 
to avoid biases, especially for the bispectrum and the L = 1 dipole-modulation part of the tnl trispectrum. 

Lensing also affects the variance of non-Gaussianity estimators; this has been calculated in Rcf. jl|, and optimized 
bispectrum estimators accounting for this extra variance have been derived in Rcf. 0] . 
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Appendix A: General lensed temperature bispectrum 



For general shapes, all three temperature multipoles need to be lensed, giving 



(f (h)f (1 2 )T(1 3 )> = 



d 2 Xl d 2 x 2 d 2 x 3 d% d% d% 
~2tt 2tt 2tt 27T"27r~27r X 

(^(l^T^l^T^l' )e _ill ' Xl e _il2 ' X2 e _il3 ' X3 e^^ 



As usual we take a to be uncorrelated to T to isolate the effect of lensing on the primordial bispectrum. 
Let's define 1' = (1 2 - 10/2 = 1' 2 + 1^/2, a 2 = a 2 - <*i, a 3 = a 3 - ax. Then 

y 

l[ ■ £*i + 1 2 • a 2 + 1 3 • a 3 = 1' • (a 2 - a 3 ) - — • (a 2 + a 3 ), 



xi ■ (li - l'x) + x 2 • (1 2 - 1 2 ) +x 3 -(la-Is) 
and hence denning r = - (xi - ^±^) = (r31 + r2l) 



xi 



x 2 +x 3 



J hhh 



exp 



d 2 r 32 d 2 r d 2 l[ d 2 \ 
2~7r^ 2~7r~ ~27r 2~7r 

d 2 r 32 d 2 r d 2 V d 2 l[ 
2?r ~2t:~2 

7/2 

'1 
4 



b l 'i!i'e lr32 ' {r - l) e ir -^- h) 



exp 




).(li-li) + (x 2 -xs) •(!-!'), 



1' 

1' • (a 2 - as) - — ■ (a 2 + a 3 ) 



cr (r 2 i) +cos2^i ir31 C g i j2 (r2i) + a (r 3 i) + cos2^ ir . 31 Cgi i2 (r 3 i) 

1 



{v 2 (r 32 ) +cos20 ; / r32 C g i, 2 (r 32 )} 



(AI) 

(A2) 
(A3) 



^^ 6 ^^^ ,;r32 ' (1 '" 1)e "' (1 ' 1 ~ ll) CX P (4 i 1 ' 2 ( ff2 (^2) + cos2^ r32 C gl , 2 (r 32 ))]) x 



(A4) 



This is consistent with the exact bispectrum lensing result of Eq. ( |A4| ) because cr 2 (0) = Cgi^CO) 
part of the bispectrum contributes to the skewness. 



: 0. Note also that only the isotropic 
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This gives the general result of lcnsing of the bispectrum. For squeezed bispectra, to 0(l\/l 2 ) we can easily recover 
the result Eq. (|8|) from the unlensed short-leg approximation since 

d 2 r 32 d 2 v d 2 V d 2 l[ 



2?r 2tt 2 
d 2 r 32 d 2 V 
2?r 2tt 



^^ 1 ^3 e?r32 (1 '" 1)eir (1 ' 1 " ll) CX P ("5 (^2) + cos2^v 32 C gl>2 (r 32 ))] 
fthijise** 8 "^ 1 '- 15 exp (-1 [/' 2 (a 2 (r 32 ) + co S 2^ r32 Qi >2 (r3 2 ))]) (A5) 



Appendix B: Angular decomposition of squeezed triangles on the full sky 

On the flat sky a bispectrum triangle can be parameterized in terms of the wavenumbers of the large-scale and 
short-scale modes, and the angle between them. On the full sky we can change from l\, Z 2 , Z 3 to l\, L = (Z 2 + Z3) /2, and 
71/ = 1% — I2, where |M| < l\. For l\ even, L is an integer, corresponding to being able to divide the long wavenumbcr 
in two, and M is even; for odd l\, L is half-integer and M is only odd. On the full sky the reduced bispectrum is 
defined by 



4tt (h h Is 



b{h,L, M) - b hl2h - J - i)[2h - mh + i) ^ Q Q Q ] B hl2l3 , (Bl) 



where we note that this change of variable is related to the Regge symmetries [[31], [32| of the 3j symbols since 

(B2) 

Using l\, L, M we can replace sums over wavenumbers using 



h h h\ _ (h L L \ 
Oj \M -M/2 -M/2J- 



E = EE E (B3) 

i 1 ! 2 (3 ii L iW=— h 

and 

(2/i + l)(2/ a + 1)(2Z 3 + 1) = (2/i + 1)[(2X + l) 2 - M 2 ]. (B4) 

For squeezed triangles l\ <C |M| <C and to 0(l\/L 2 ) corrections in the flat sky limit 4 M = l\ cos(0« 1 ). 

The Fisher matrix gives the expected error on a bispectrum estimate, and the overlap function F(B,B') quantifies 
the bias when a bispectrum B is estimated and a different bispectrum shape B' is present. For squeezed triangles 
(with li -C 200, corresponding to the spacing of the acoustic peaks), C; 2 C/ 3 = C£ + 0{l\/L 2 \ so the overlap function 
is given by 

F(P R')- 1 V B ^ B 'hhh „ 1 y (2Z 1 + 1)(2L + 1) 2 L L \ 2 , ) ( y , . 

B ) - 6 E QiC|aCla - e E c^T 2 E [^M -M/2 -M/2) L ' M) ' (B5) 



The 3j term factorizes into terms involving M only through a combination of l\ and M 

2 /, -r t \2 /„ T j \ I //r 7 , , , ,„>|\ 2 



(B6) 



Whh\ = (h L L \'_ (2L-h)\ f ([L + h]/2)\ y (h - M)\(h + M)\ \ 

Oj \M -M/2 -M/2) {2L + h + l)\\{[L-h]/2)\) - M]/2)\([h + M}/2)\) 2 ) 

so we can define polynomials P m {M) in M that depend only on M and Zi such that 

E(m -M/2 -M/2) ^n(M)P m ,(A/)oc ( 5w (B7) 



More generally one can define cos</> = 2M/(2l\ + 1); as one approaches the flat-sky limit the angular dependence of the 3j symbol 

Q p „ J [. ..] tx J d<f>[. . .]; i.e. roughly that ( J <x l/sin(<£) so that the weight per M mode is much 

larger for sin0 ~ (i.e. \M\ ~ Zi) than for M ~ 0, reflecting the fact that there are more triangles with integer sides for near-isosceles 
than near- flat squeezed shapes at fixed L. 
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for i,j < l\. Expanding the bispectrum in terms of P m , this relation ensures that bispectra with different m compo- 
nents are orthogonal to 0(l\/L 2 ) (Eq. (B5) gives zero). Explicitly we can define P = 1 and 



l(P 2 (M) = 2M 2 - h{h + 1) itPi(M) = 8M 4 - 8{h - l){h + 2)M 2 + h(h + l)(h + 3)(h - 2), 



(B8) 



where the l\ factors on the LHS are for convenience so that in the flat sky limit P m (M) = cos(m0). Thus we define 
the angular moments of the bispectrum on the full sky by 



,J hL 



Em (m -M/2 -M/2J Pm ( M ) b hi2h 
Em (m _m/ 2 -M/2J P ™( M ) 2 



(B9) 



(if desired the normalization can be calculated analytically). 

As an example of an explicitly anisotropic bispectrum, the full-sky temperature lensing reduced bispectrum can be 
expanded in the squeezed limit (where most of the signal is) as 



1 



2 [h(h + 1) + h{h + 1) - h{h + l)]C//a 2 + perms. 
h(h + l)(Ci a + C h ) - M(2L + l)(C h - C h ) 



°«i 2 



Till 



l l {l l + l)C L + l -M 2 {2L + l) < ^- 



Tip 



2M 



r2 



h(h + l) 



- 1 



dC L 



_L d[L(L + l)C L ] 

dln(2L + l) L(L + 1) dln(2L+l) 



(BIO) 
(Bll) 

(B12) 
(B13) 



and hence has Pq(M) and P 2 (M) components, corresponding to the scalar and quadrupolar angular dependence 
expected from magnification and shear. Local non-Gaussianity also gives significant \m\ > 2 components on scales 
where l\ is sub- horizon at recombination (see Fig. |l|). 



Appendix C: Squeezed approximation for tnl CMB trispectrum 



Here we approximate the full expression for tnl and relate to the modulation estimator (c.f. Ref. 1 33 ) . The 
unlensed tnl reduced trispectrum is p8| , |33f 

p[^(L)=T N L / dr 1 dr 2 r2r 2 2 ^ L (r 1 ,r 2 )K(r 1 )/3 (2 (r 1 )+/3 il (r 1 )a i2 (r 1 )] [a h (r 2 )P u (r 2 ) + (3 h (r 2 )a h (r 2 )} (CI) 

where 

oj(r) = ^ J dlnfcj ; (fcr)^^ 

Mr) = 4^ J ' d]nkjiQer)Ai(k)V c (k) (C2) 

and 

F L (r u r 2 ) =4ttJ dlnkV c (k)j L (kn)j L (kr 2 ). (C3) 

We are interested in very low L where all the signal to noise is, so that L <C h,h,h, h- For small scales ai(r) and /3;(r) 
are sharply peaked at recombination, but Fl (r\ , r 2 ) varies only very slowly on the scale of the large-scale modulation 
modes. Therefore 

Z3Z4 1 



p\Hl(L)ttT NIu F L (n,n) / dnrHaMfain) + PUn^in)] / dr 2 r 2 2 {a h (r 2 )P h (r 2 ) + P h (r 2 )a u (r 2 )} . (C4) 
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Note that Ft, (r* , r* ) = C^* , i.e. the angular power spectrum of the primordial curvature perturbations at recombina- 
tion. Also since L is very small, much smaller than the scale of variation of the C;, l\ ~ Z 2 and so we have 



r 2 dr [a h (r)0 h (r) + h (r)a h (r)] ~ C h + C h . (C5) 

This is exact for l\ = I2, and accurate to sub-percent level for \l\ — Z2I < 10 that is relevant for tnl- Hence 

p\£(L) « TNLfi^.r.)^ + C Ia )(a, + C U ) « TNLCi^Q, + C la )(Ci 3 + C l4 ). (C6) 



This agrees with the approximation and definition of Eqs. (J36|) (35) in the main text. For L — 1 the relevant scales 
controlling the approximations are the thickness of the last-scattering surface (~ lOOMpc) and the distance to the 
last-scattering surface r* ~ 14000Mpc, and hence approximations are expected to be accurate at the percent level 
for high h,l2,h,h (3> 100). The approximate effect of lensing is also clear: the small-scale temperature modes 
contribute to the trispectrum via their power spectrum, and averaging over small-scale lensing modes simply changes 
the unlensed power spectra to the lensed power spectra. 
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